智能论文笔记

Spatio-temporal estimation of wind speed and wind power using machine learning: predictions, uncertainty and technical potential

Federico Amato , Fabian Guignard , Alina Walch , Nahid Mohajeri , Jean-Louis Scartezzini , Mikhail Kanevski

分类：人工智能 | 机器学习

2021-07-29

在过去的几十年中，风产能的增长表明，风能可以促进世界许多地区的能源过渡。对于模型的高度可变和复杂，对风能的时空变化和相关的不确定性的定量与能源计划者高度相关。机器学习已成为执行风速和功率预测的流行工具。但是，现有方法有几个局限性。其中包括（i）在风速数据中不足以考虑时空相关性，（ii）缺乏量化风速预测不确定性及其对风能估算的不确定性的现有方法，以及（iii）焦点在少于小时的频率上。为了克服这些局限性，我们引入了一个框架，以从不规则分布的风速测量值中的常规网格上重建时空场。将数据分解为时间引用的基础函数及其相应的空间分布系数后，后者是使用极端学习机对空间建模的。然后，对模型和预测不确定性的估计及其在风速转化为风能后的传播的估计值，然后将提供对数据分布模式的任何假设。该方法适用于研究瑞士100米轮毂高度的250 x 250平方米的小时风能潜力，为该国提供了其类型的第一个数据集。潜在的风力发电与风力涡轮机安装的可用区域相结合，以估算瑞士风力发电的技术潜力。此处介绍的风力估算代表了计划人员的重要意见，以支持风力发电增加的未来能源系统的设计。

translated by 谷歌翻译

Unsupervised construction of representations for oil wells via Transformers

Alina Rogulina , Nikita Baramiia , Valerii Kornilov , Sergey Petrakov , Alexey Zaytsev

分类：机器学习

2022-12-29

Determining and predicting reservoir formation properties for newly drilled wells represents a significant challenge. One of the variations of these properties evaluation is well-interval similarity. Many methodologies for similarity learning exist: from rule-based approaches to deep neural networks. Recently, articles adopted, e.g. recurrent neural networks to build a similarity model as we deal with sequential data. Such an approach suffers from short-term memory, as it pays more attention to the end of a sequence. Neural network with Transformer architecture instead cast their attention over all sequences to make a decision. To make them more efficient in terms of computational time, we introduce a limited attention mechanism similar to Informer and Performer architectures. We conduct experiments on open datasets with more than 20 wells making our experiments reliable and suitable for industrial usage. The best results were obtained with our adaptation of the Informer variant of Transformer with ROC AUC 0.982. It outperforms classical approaches with ROC AUC 0.824, Recurrent neural networks with ROC AUC 0.934 and straightforward usage of Transformers with ROC AUC 0.961.

translated by 谷歌翻译

MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

Alina Petukhova , Nuno Fachada

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-22

This article presents a dataset of 10,917 news articles with hierarchical news categories collected between January 1st 2019, and December 31st 2019. We manually labelled the articles based on a hierarchical taxonomy with 17 first-level and 109 second-level categories. This dataset can be used to train machine learning models for automatically classifying news articles by topic. This dataset can be helpful for researchers working on news structuring, classification, and predicting future events based on released news.

translated by 谷歌翻译

Beyond SOT: It's Time to Track Multiple Generic Objects at Once

Christoph Mayer , Martin Danelljan , Ming-Hsuan Yang , Vittorio Ferrari , Luc Van Gool , Alina Kuznetsova

分类：计算机视觉

2022-12-22

Generic Object Tracking (GOT) is the problem of tracking target objects, specified by bounding boxes in the first frame of a video. While the task has received much attention in the last decades, researchers have almost exclusively focused on the single object setting. Multi-object GOT benefits from a wider applicability, rendering it more attractive in real-world applications. We attribute the lack of research interest into this problem to the absence of suitable benchmarks. In this work, we introduce a new large-scale GOT benchmark, LaGOT, containing multiple annotated target objects per sequence. Our benchmark allows researchers to tackle key remaining challenges in GOT, aiming to increase robustness and reduce computation through joint tracking of multiple objects simultaneously. Furthermore, we propose a Transformer-based GOT tracker TaMOS capable of joint processing of multiple objects through shared computation. TaMOs achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark. Finally, TaMOs achieves highly competitive results on single-object GOT datasets, setting a new state-of-the-art on TrackingNet with a success rate AUC of 84.4%. Our benchmark, code, and trained models will be made publicly available.

translated by 谷歌翻译

Legal Prompt Engineering for Multilingual Legal Judgement Prediction

Dietrich Trautmann , Alina Petrova , Frank Schilder

分类：自然语言处理 | 人工智能

2022-12-05

Legal Prompt Engineering (LPE) or Legal Prompting is a process to guide and assist a large language model (LLM) with performing a natural legal language processing (NLLP) skill. Our goal is to use LPE with LLMs over long legal documents for the Legal Judgement Prediction (LJP) task. We investigate the performance of zero-shot LPE for given facts in case-texts from the European Court of Human Rights (in English) and the Federal Supreme Court of Switzerland (in German, French and Italian). Our results show that zero-shot LPE is better compared to the baselines, but it still falls short compared to current state of the art supervised approaches. Nevertheless, the results are important, since there was 1) no explicit domain-specific data used - so we show that the transfer to the legal domain is possible for general-purpose LLMs, and 2) the LLMs where directly applied without any further training or fine-tuning - which in turn saves immensely in terms of additional computational costs.

translated by 谷歌翻译

Impact of Domain-Adapted Multilingual Neural Machine Translation in the Medical Domain

Miguel Rios , Raluca-Maria Chereji , Alina Secara , Dragos Ciobanu

分类：自然语言处理

2022-12-05

Multilingual Neural Machine Translation (MNMT) models leverage many language pairs during training to improve translation quality for low-resource languages by transferring knowledge from high-resource languages. We study the quality of a domain-adapted MNMT model in the medical domain for English-Romanian with automatic metrics and a human error typology annotation which includes terminology-specific error categories. We compare the out-of-domain MNMT with the in-domain adapted MNMT. The in-domain MNMT model outperforms the out-of-domain MNMT in all measured automatic metrics and produces fewer terminology errors.

translated by 谷歌翻译

Mapping the Ictal-Interictal-Injury Continuum Using Interpretable Machine Learning

Alina Jade Barnett , Zhicheng Guo , Jin Jing , Wendong Ge , Cynthia Rudin , M. Brandon Westover

分类：计算机视觉 | 人工智能 | 机器学习

2022-11-09

IMPORTANCE: An interpretable machine learning model can provide faithful explanations of each prediction and yet maintain higher performance than its black box counterpart. OBJECTIVE: To design an interpretable machine learning model which accurately predicts EEG protopatterns while providing an explanation of its predictions with assistance of a specialized GUI. To map the cEEG latent features to a 2D space in order to visualize the ictal-interictal-injury continuum and gain insight into its high-dimensional structure. DESIGN, SETTING, AND PARTICIPANTS: 50,697 50-second cEEG samples from 2,711 ICU patients collected between July 2006 and March 2020 at Massachusetts General Hospital. Samples were labeled as one of 6 EEG activities by domain experts, with 124 different experts providing annotations. MAIN OUTCOMES AND MEASURES: Our neural network is interpretable because it uses case-based reasoning: it compares a new EEG reading to a set of learned prototypical EEG samples from the training dataset. Interpretability was measured with task-specific neighborhood agreement statistics. Discriminatory performance was evaluated with AUROC and AUPRC. RESULTS: The model achieves AUROCs of 0.87, 0.93, 0.96, 0.92, 0.93, 0.80 for classes Seizure, LPD, GPD, LRDA, GRDA, Other respectively. This performance is statistically significantly higher than that of the corresponding uninterpretable (black box) model with p<0.0001. Videos of the ictal-interictal-injury continuum are provided. CONCLUSION AND RELEVANCE: Our interpretable model and GUI can act as a reference for practitioners who work with cEEG patterns. We can now better understand the relationships between different types of cEEG patterns. In the future, this system may allow for targeted intervention and training in clinical settings. It could also be used for re-confirming or providing additional information for diagnostics.

translated by 谷歌翻译

Direct Speech Translation for Automatic Subtitling

Sara Papi , Marco Gaido , Alina Karakanta , Mauro Cettolo , Matteo Negri , Marco Turchi

分类：自然语言处理

2022-09-27

自动副标题是将视听产品的语音自动转化为短文本的任务，换句话说，字幕及其相应的时间戳。生成的字幕需要符合多个空间和时间要求（长度，阅读速度），同时与语音同步并以促进理解的方式进行分割。鉴于其相当大的复杂性，迄今为止，通过分别处理转录，翻译，分割为字幕并预测时间戳的元素来解决自动字幕。在本文中，我们提出了第一个直接自动字幕模型，该模型在单个解决方案中从源语音中生成目标语言字幕及其时间戳。与经过内外数据和外域数据训练的最先进的级联模型的比较表明，我们的系统提供了高质量的字幕，同时在整合性方面也具有竞争力，并具有维护单个模型的所有优势。

translated by 谷歌翻译

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora

Sara Papi , Alina Karakanta , Matteo Negri , Marco Turchi

分类：自然语言处理

2022-09-21

字幕（替代）的语音翻译是通过将符合特定显示指南的字幕划分插入字幕中断，将语音数据自动转化为良好的字幕。与语音翻译（ST）类似，模型训练需要并行数据，其中包括音频输入与其文本翻译配对。然而，在替代方面，还必须用字幕断裂来注释文本。到目前为止，这一要求代表了系统开发的瓶颈，如公开可用的替代公司所证实。为了填补这一空白，我们提出了一种在不干预的情况下将现有的ST Corpora转换为替代资源的方法。我们构建了一个分段模型，该模型通过以多模式的方式利用音频和文本来自动将文本片段分为适当的字幕，从而在零拍摄条件下实现了高分子的质量。对手动和自动分割培训的替代系统的比较实验导致相似的性能，显示了我们方法的有效性。

translated by 谷歌翻译

Histogram Layers for Synthetic Aperture Sonar Imagery

Joshua Peeples , Alina Zare , Jeffrey Dale , James Keller

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-08

合成孔径声纳（SAS）图像对于多种应用至关重要，包括目标识别和环境分割。深度学习模型在SAS分析中取得了很大的成功。但是，这些方法提取的功能可能不适合捕获某些纹理信息。为了解决这个问题，我们介绍了直方图在SAS图像上的新应用。在深度学习模型中添加直方图层，通过合并合成和现实世界数据集的统计纹理信息来改善性能。

translated by 谷歌翻译